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Tight Linear Convergence Rate Bounds for 
Douglas-Rachford Splitting and ADMM 

Pontus Giselsson* 


Abstract —Douglas-Rachford splitting and the alternating 
direction method of multipliers (ADMM) can be used to solve 
convex optimization problems that consist of a sum of two 
functions. Convergence rate estimates for these algorithms have 
received much attention lately. In particular, linear convergence 
rates have been shown by several authors under various 
assumptions. One such set of assumptions is strong convexity 
and smoothness of one of the functions in the minimization 
problem. The authors recently provided a linear convergence 
rate bound for such problems. In this paper, we show that this 
rate bound is tight for many algorithm parameter choices. 

I. Introduction 

Douglas-Rachford splitting is an optimization algorithm 
that can solve general convex composite optimization prob¬ 
lems. The algorithm has its roots in the 1950’s [5], [17], In 
the late 1970’s, it was shown [14] how to use the algorithm 
to solve montone operator inclusion problems and convex 
composite optimization problems. The alternating direction 
method of multipliers (ADMM) can also solve composite 
optimization problems. It was first presented in [11], [7]. 
Soon thereafter, it was shown [6] that ADMM is equivalent 
to Douglas-Rachford splitting applied to the dual problem. 

General sublinear convergence rate estimates for these 
methods have just recently been presented in the literature, 
see [12], [3], [1]. Under various assumptions, also linear 
convergence rates can be established. In the paper by Lions 
and Mercier [14], a linear convergence rate was provided 
for Douglas-Rachford splitting under (the equivalence of) 
strong convexity and smoothness assumptions. Until recently, 
further linear convergence rate results have been scarce. The 
last couple of years, however, several linear convergence 
rate results for both Douglas-Rachford splitting and ADMM 
have been presented. These include [4], [2], in which linear 
convergence rates for ADMM are presented under various 
assumptions. In [13], linear convergence rates are estab¬ 
lished for multiple splitting ADMM. In [16], it is shown 
that for a specific class of problems, the Douglas-Rachford 
algorithm can be interpreted as a gradient method of a 
function named the Douglas-Rachford envelope. By showing 
strong convexity and smoothness properties of the Douglas- 
Rachford envelope under similar assumptions on the under¬ 
lying problem, a linear convergence rate is established based 
on gradient algorithm theory. Very recently [15] appeared 
and showed linear convergence of ADMM under smoothness 
and strong convexity assumptions using the integral quadratic 
constraints (IQC) framework. The rate is obtained by solving 
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a series of a small semi-definite programs. Common for all 
these linear convergence rate bounds are that they are not 
tight for the class of problems under consideration, see [10, 
Section IV.B], 

In [18], linear convergence of ADMM is established 
under more general assumptions than the above. However, 
the assumptions are more difficult to verify for a given 
problem. Tightness is verified for a 2-dimensional example 
in the Euclidean case. In [8], linear convergence for ADMM 
on strongly convex quadratic optimization problem with 
inquality constraints is established. This rate improves on 
the rates presented in [14], [4], [2], [13], [16], [15]. In 
[9], the authors generalize, using a completely different 
machinery, the results in [8] and in [10] the results are further 
generalized. More specifically, [10] generalizes the results in 
[8] in the following three ways; (i) a wider class of problems 
is considered, (ii) rates for both Douglas-Rachford splitting 
and ADMM are provided, and (iii) the results in [10] hold for 
general real Hilbert spaces as opposed to the Euclidean space 
only in [8]. For the restricted class of problems considered 
in [8], the convergence rate bounds in [10] and [8] coincide. 

The contribution of this paper is that we show tightness of 
the convergence rate bounds presented in [10] for the class 
of problems under consideration and for many algorithm 
parameters. This is done by formulating examples, both for 
Douglas-Rachford splitting and ADMM, for which the linear 
convergence rate bounds are satisfied with equality. Similar 
lower convergence rate bounds have been presented in [15]. 
The bounds in this paper cover wider classes of problems 
and are less conservative. 

II. Notation 

We denote by R the set of real numbers, R" the set of real 
column-vectors of length n. Further R := RU {oo} denotes 
the extended real line. Throughout this paper H denotes a 
real separable Hilbert space. Its inner product is denoted by 
(•,•}, the induced norm by || • ||, and the identity operator 
by Id. The indicator function for a set X is denoted by l%- 
Finally, the class of closed, proper, and convex functions 
/ : TL —> R is denoted by 

III. Preliminaries 

In this section we present, well known concepts, results, 
operators, and algorithms that will be extensively used in the 
paper. 

Definition 1 (Orthonormal basis): An orthonormal basis 
for a (separable) Hilbert space H is an orthogonal 


basis, i.e. (f>j) = 0 if i ^ j, where each basis vector has 
unit length, i.e. ||</>i|| = 1 . 

Hereon, 7 will denote elemtens of an orthonormal basis. 

Remark 1: The number of elements in the basis (the 
cardinality) K is equal to the dimension of the corresponding 
Hilbert space, which might be oo. Also, by definition of a 
basis, each element x £ PL can be (uniquely) decomposed 
as x = see [20, Proposition 3.3.10]. 

The reason why we consider separable Hilbert spaces is 
the following proposition which can be found, e.g., in [ 20 , 
Proposition 3.3.12]. 

Proposition 1: A Hilbert space is separable if and only if 
it has an orthonormal basis. 

We will also make extensive use of the following two 
propositions that are proven, e.g., in [20, Propsition 3.3.10] 
and [20, Propsition 3.3.14] respectively. 

Proposition 2 (Parseval’s identity): In separable Hilbert 
spaces PL, the squarred norm of each element x £ PL satisfies 


K 

INI 2 = K^)! 2 - 

i—1 

Proposition 3 (Riesz-Fischer): In separable Hilbert spaces 
PL, the sequence 1 a Ay converges if and only if 

ESi < oo- Then 


K 




i=l 


K 


= £« 

i=l 


Definition 2 (Strong convexity): A function f £ To(PL) is 
a-strongly convex if 


fix) >f(y) + (u,x-y) + %\\x-y\\ 2 

holds for all x,y £ PL and all u £ df{y). 

Definition 3 (Smoothness): A function f £ Tq(PL) is /3- 
smooth if it is differentiable and 


fix) < fiy) + {V fiy),x - y) + %\\x - y\\ 2 ( 1 ) 

holds for all x,y £ PL. 

Definition 4 (Proximal operators): The proximal opera¬ 
tor of a function / £ Tq(PL) is defined as 

Prox 7/ (y) := argrnin {/(, x) + ±\\x - y|| 2 |. 


Definition 5 (Reflected proximal operators): The 
reflected proximal operator to f £ To(PL) is defined 
as 


f? 7 / := 2prox 7 j — Id. 


Definition 6 (Fixed-point): A point y £ PL is a fixed-point 
to the (single-valued) operator A : PL -A PL if 

V = Ay. 

The set of fixed-points to A is denoted by fix/1. 


Algorithm 1 (Generalized Douglas-Rachford splitting): 
The generalized Douglas-Rachford splitting algorithm is 
given by the iteration 

z k+1 = (1-a)l& + aR ig FL lf z k (2) 

where a £ ( 0 , 1 ) and 7 > 0 are algorithm parameters. 

Remark 2: In the general case, a is restricted to the 
interval (0,1). Under the assumptions used in this paper, 
a larger a can be used as well, see [ 10 ]. 

IV. Linear convergence rates 

In this section, we state the linear convergence rate results 
for Douglas-Rachford and ADMM in [10]. The paper [10] 
considers optimization problems of the form 

minimize f{x) + g{Ax) (3) 

where x £ PL, and f, g, and A satisfy the following 
assumptions: 

Assumption 1: 

(i) The function / £ To (PL) is cr-strongly convex and / 3- 
smooth. 

(ii) The function g £ To(/C). 

(iii) A : PL. -A JC is a surjective bounded linear operator. 
Under the additional assumption that ^4. = Id (which implies 
that K. = PL), Douglas-Rachford splitting can be applied to 
solve 0. It enjoys a linear convergence rate, as shown in 
[10, Theorem 1]. This result is restated here for convenience. 


Theorem 1: Suppose that Assumption Q] holds and that 
A = Id. Then the generalized Douglas Rachford algo¬ 
rithm (Algorithm Q} converges linearly towards a fixed- 
point z £ fi x(R 1 fR lg ) with at least rate 11 — 

/ 7/3—1 l—7cr 
V 70+1 ’ 1+7<t 


a max I 


i.e. 
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a max 
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for any 7 > 0 and a £ ( 0 , ■ 


\ 7/^+1 ’ 1+70- 
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l+max ^ 1+7(j , 1+7(3 j 

Remark 3: The bound on the rate in Theorem Q] can be 
optimized with respect to the algorithm parameters a and 7 . 
The optimal parameters are given by a = 1 and 7 = -^= 

^ 1 , see [10, Proposition 


//3/cr+l 


which yields rate bound factor 
16]. 

In the case where A / Id, problem (0 can be solved by 
applying Douglas-Rachford splitting on the dual problem: 


minimize 


d ih) +9*iR) 


(4) 


where g* £ To(/C), and d £ To(/C) is defined as 

d := /* o (-4*)- 

If the dual problem 0 satisfies Assumption!]] (with d instead 
of / and g* instead of g), Douglas-Rachford splitting can be 
applied to solve 0, and Theorem [j] would guarantee a linear 
convergence rate. Since g £ To(/C), we have g* £ To(/C) 
[19, Theorem 12.2], and we have A in Assumption [TJiii) 














equal to Id in 0. The remaining assumption needed to apply 
Theorem[j]is that d £ F'o(/C) is strongly convex and smooth. 
Indeed, this is the case as shown in [10, Proposition 18]. This 
result is restated here for convenience of the reader. 

Proposition 4: Suppose that Assumption Q] holds. Then 
d £ To(/C) is —smooth and ^--strongly convex, where 

9 > 0 always exists and satisfies ||.4*p,|| > 0 ||/x|| for all 
p. £ K,. 

It is well known [ 6 ] that Douglas-Rachford splitting ap¬ 
plied to the dual problem 0 is equivalent to ADMM applied 
to the primal problem (0. Therefore, the linear convergence 
rate obtained by applying Douglas-Rachford splitting to the 
dual problem 0 directly translates to a linear convergence 
rate for ADMM. This linear convergence rate bound is stated 
in [10, Corollary 2], and restated here for convenience. 

Proposition 5: Suppose that Assumption Q] holds and that 
generalized Douglas-Rachford is applied to solve the dual 
problem 0. Then the Douglas-Rachford splitting algorithm 
converges linearly towards a fixed-point z £ fix(f? 7 df? 7S .) 

with at least rate |1 — a\ + a max ( 7 ^~ 1 , ) - i.e. 

1 1 V 7/3+1 ’ 1+7 ) 

IN fc+1 - z\\ < (|1 - oe\ + amax (4^, ))* \\z° - z\\ 

for any 7 > 0 and a £ ( 0 ,--- : -), where 

,3=1411 ,„di=f 

Remark 4: The parameters that optimize the convergence 
rate bound are a = 1 and 7 = _ and the 

linear convergence rate bound factor is where k = 

| = see [10, Corollary 2], 

V. Tightness of rate bounds 

In this section, we will state examples that show tightness 
of the linear convergence rate bounds in Theorem Q] and 
Proposition [5] for many choices of algorithm parameters. 

A. Primal Douglas-Rachford splitting 

To establish that the convergence rate bound provided in 
[10, Theorem 1] and restated in Theorem Q] is tight, we 
consider a problem of the form 0 with 


K 


fix) = ¥ix,fi) 2 , 

2=1 

(5) 

Six) = 0 , 

( 6 ) 

A = Id. 

(7) 


Here {fi}^L 1 is an orthonormal basis for PL, K is the 
dimension of the space PL (possibly infinite), and A; is either 
0 or /3. We denote the set of indices i with A^ = o by I a 
and the set of indices i with Ai = (3 by Ip. We require that 
I„ 7 ^ 0, that Ip 7 ^ 0, and we get that X a D Ip = 0 and 
I a UX 8 = {1,...,K}. 

First, we show that / in 0 is defined (finite) for all x £ PL, 
even if PL is infinite dimensional. Obviously f(x) > 0 for 


all x £ PL. We also have for arbitrary x £ PL that 

fix) = J2 y i x > ^ § ^2( x > <^> 2 = f imi 2 < 00 

2=1 ^ 2=1 

where the last equality follows from Parseval’s identity. 
Therefore, the optimization problem ([3]) with /, g , and A 
as in ©, ©. and 0 respectively is well defined also on 
infinite dimensional spaces. 

Next, we show that / £ Tq(PL) satisfies Assumption [lji), 
i.e., that / is /3-smooth and cr-strongly convex. 

Proposition 6: The function /, as defined in 0 with A, = 
o for i £ X a and Ai = /? for i £ Ip, is cr-strongly convex 
and /3-smooth. 

Proof. Since PL has a orthonormal basis, each element x £ PL 
may be decomposed as x = 'Yfa=\i x i4 > i)^ ) i. We let a,i = 
(x, fi) and h, = {y, fi), to get arbitrary x = J^iLi a i<t>i £ 'H 
and y = Y^iLi bifi £ PL- Then 

\\\ x ~y\\ 2 = |ll J2 a i<t>i - J2bifi\\ 2 
2=1 2=1 

= J2 ^(o* - b i ) 2 > J2 y(a* - bif 
2=1 ^ 2=1 

K (l 9 l \ 

= ^ ' Xi ( 2 ®* 2 ^* {bifii CLifi bifi } J 

»=1 ' ' 

K 

= fix) - f(y) - fi)fi, (ai - bi)fi) 

i=i 

K K 

= fix) - fiy) - (^2 Xi{y,<t>i)fi,Y^i a i ~ bi 

2=1 2=1 

= fix) - fiy) - (Vfiy), x-y) 

where the second equality follows from Riesz-Fischer, the 
first inequalty holds since /3 > A i for all i = 1 ,,K, the 
third equality follows by expanding the square and noting 
that aibi = 6 * 0 *) and bj = {bifi,bifi), the fourth 

equality follows by identifying the definition of / in 0 
and using ai = (x, fi) and bi = ( y, fi), the fifth equality 
holds since the added cross-terms vanish in the inner product 
expression due to orthogonality of basis vectors fi, and 
the final equality holds by identifying x = \ a ifi> 

y = ^ 2^1 bifi, and the gradient of / in 0 : 

V/(z) = y^Xi(x,fi)fj. 

2 

This is the definition of /3-smoothness in Definition [3] 

An equivalent derivation using a instead of /3 and a 
reversed inequality, shows that / is also cr-strongly convex. 
□ 

To show that the provided example converges exactly with 
the rate given in Theorem Q] we need expressions for the 
proximal operators and reflected proximal operators of / and 
g in © and © respectively. 
















Proof. We have 


Proposition 7: The proximal operator of / in (0 is 


K 

prox 7 / (2/) = E 1+7+2/, &)& ( g ) 

2=1 

and the reflected proximal operator is 

K 

^7/(2/)=EirSr^^- ^ 

i=l 


Proof. We decompose x = XEi a +j where = (x, ff) 
and y = where bi = ( y,(f>i }. Then, for general 

7 > 0 , the proximal operator of / is given by: 


prox 7 / (y) = argnain j 7 |E + + sll^-yll 2 j 


= arg mm 


/ K 

E 

\i= 1 
if 


) + 


K 


E( a » _ 


= arg mm 
*=£* * a 4 

K 


\ E (e *® 2 + + ~ 6 o 2 ) 

i=l 

= E ar g min I {T'W + («i - &+} fi 

i=1 

K K 

= E l+W 6 ^ = E T+W + < ^>‘+ 


2=1 


2=1 


= (1 - a:)/(l + x)<(y- 1)(1 + y) = -f{y) 
++ (1 -x){l +y) < (y- 1)(1 +x) 

■+> 2 < 2 xy. 


□ 

Theorem 2: The generalized Douglas-Rachford splitting 
algorithm (Algorithm 0 when applied to solve 0 with /, 
g, and A in 0-© converges exactly with rate 

|l-a| + amax(-^,^+) ( 10 ) 

in the following cases: (i) a = 1 and 7 £ ( 0 ,oo), (ii) a £ 
( 0 , 1 ] and 7 £ ( 0 , 7 ^], (in) a £ [ 1 ,- 7 g_ n ) 

and 7 e [ 7 b’ 00 )’ (iv ) a € (°.- /i-E 7 a-t\ ) and 

l+max(^ TT ^;, T 3 ^j 

7 = vk- 

Proof. For algorithm initial condition z° = (j> t the Douglas- 
Rachford algorithm evolves according to 

z ‘ = ( 1 -“ + “L+* 

where Ai is either o or /? depending on if i £ 2^ or j £ Xg. 
This follows immediately from Algorithm 0 the expression 
of R~,f in Proposition 0 and since R lg = Id. Obviously, 
this converges with rate factor 


The reflected resolvent for general 7 > 0 is given by: 

Rif(y) = 2prox 7/ (y) - y 

K K 

— 2 'y ' 2+7A; ^ifi — y ' bifii 
2=1 2 = 1 

= E = E TrSf + <++ 

i=l i=l 

□ 

The proximal and reflected proximal operators of g = 0 
are trivially given by prox 7g = R lg = Id. 

Next, these results are used to show a lower bound on the 
convergence rate of Douglas-Rachford splitting for several 
choices of algorithm parameters a and 7. First, we state two 
help lemmas. 

Lemma 1: The function ip(x) = yff; is a decreasing 
function for x > — 1. 

Proof. We have 

(1 - x)/(l + x) < (1 - y){ 1 + y) 

++ (1 — at)(l + 7 /) < {l-y){l + x) 

•+> 2 y < 2x. 

□ 

Lemma 2: For x > —1, the function ip{x) = j satisfies 
fix) < — fiy) if and only if y > 1/x. 


1 — a + a 


l+7^i 


Below, we show for each of the four cases that this rate 
coincides with the rate (ITOl) . 


Case (i): a = 1 and 7 £ (0,oo) 

The rate in this case when z° = fi, i £ I a , is exactly 


1 — 7(7 


1+7 G 


. The rate when z° = fi, i £ Ip, is exactly 


1-7/3 

1+7/3 


A lower bound on the convergence of the algorithm when 
a = 1 is therefore 


max 


1 — 7(7 

1 + 7(7 


1-7/3 

1+7/3 


— vna-x- ( !—7 CT 7/3—1 \ 
maX V I+70 ’ 1+7,0 ) 


= | 1 -Q/| 


a max 


( 1-70- 7/3-1 \ 

\ I+70 ’ 1+7(3 J 

where the first equality is due to Lemma 0 and the second 
holds since a = 1. This proves the first claim. 

Case (ii): a £ (0,1] and 7 £ (0, 7 = 5 ] 

The rate when using initial condition z° = fi, i £ I a , is 
r a := 1 - a + (since (1 - a) > 0 and > 0 ). 

For z° = fi, i £ Ip, and 7 < 4, we get 


|1 ~ a + a \Yj]} \ < |1 — a \ 

= 1 — a H 


< 1 - a + = r a 


,, 3-7/3 

1+7/8 

( 4 = 22 ; 

1 + 7(7 















































where the last inequality holds due to LemmaQ] For z° = (p t , 
i G Ip, and 7 6 [i, 7 =^], we get 


< 1 — a + 


1 —7<7 
a , i 
1+7CT 


where the last inequality follows from Lemma [2] Thus, a 
lower bound on the rate for a G (0,1] and 7 G (0, ^ 7 = 7 ] is 


r CT = 1 - a + a^Z£ = |1 - o| + amax (^2, . 

This proves the second claim. 


Case (iii): a G 


[ 1 .- 

1+max 


i-V and 7 e [ 7 ^,°°) 

1+7<T ’ 1—7/3 y 


The rate when using z° = (f>i, i G la, is 773 := ct — 1 + 
“tItI ( since (1 - a) < 0 and a-j^| < 0). For z° = </*, 
* G I CT , and 7 G [ 7 = 5 > 7 ] the rate is 


|1 — a - 


a 


1- 




l+7<7 I — 

= a — 1 
< a- 1 


«7 


1 - 


70 


CK 


1+70-1 

1 — 7<T 

1+7 G 
7/3—1 

= rp 


where the last inequality follows from Lemma [2] For z° = 
(j>i, i G la, and 7 > 7 , we get 


|1 — a - 


= a — 1 H 
< a — 1 H 


a 


1-70- I 


a 


1+7C7 I 
7(7— 1 
1+7<T 

7/3—1 

a U^p = r P 


where the last inequality is due to LemmaQ] This implies that 
a lower bound on the rate for a G [1,- r \-~, n ^a-i \ ) 


1+max 


and 7 e [ 7 b’ 00 ) is 


( 1— 7 (7 7/3—1 \ 

I+70 


-7/3 J 


re = a - 1 + \ = |1 - a\ + amax . 

Case (iv): a G (0,- ( ) and 7 = ^ 

This Case follows directly from Cases (ii) and (iii). □ 

The convergence rate for the example given by / and g in 
© and © respectively coincides with the upper bound on 
the convergence rate in [10, Theorem 1] (which is restated 
in Theorem Q} . The bound in [10, Theorem 1] is therefore 
tight for the class of problems under consideration and for the 
combination of algorithm parameters specified in Theorem^ 

Remark 5: The upper bound on the rate in [10, Theorem 
1 ], relies on the triangle inequality between (1 — a)(z k — z) 
and a{R lg R 1 fZ k — R lg R 1 fz). To get equality, we must find 
a, 7 and z k such that (1 — a){z k — z) and a{R lg R 1 fZ k — 
R-ygR-yfz) are parallel. For remaining combinations of 7 and 
a, these become anti-parallel, and the rate bound is not met 
exactly. Note, however, that for optimal choices of a and 7 , 
the bound is tight. 


B. Dual Douglas-Rachford splitting (ADMM) 

This section concerns tightness of the rate bounds when 
Douglas-Rachford splitting is applied to the dual problem 
0 , or equivalently, when ADMM applied to the primal 
problem ©. To show tightness in this case, we consider 
the following problem 

K 

f( x ) ='52¥( x ’ < t > i) 2 (H) 

2=1 

g(x) = i x = 0 (x) ( 12 ) 

K 

A(x) = ^2 v i( X > 0*)0* (13) 


where A/ = <7 and v t = 0 > 0 if * G I„ and \ = /3 
and Vi = £ > 9 if * G Ip, where I a and Ip are the same 
as before. That A is linear follows trivially. That it is self- 
adjoint, bounded, and surjective is shown in the following 
proposition. 

Proposition 8: The linear operator A defined in (IT3l > is 
self-adjoint, i.e. A = A*, and for every x G R, we have 

0|MI < II++II < CIMI- (14) 

Further \\A\\ = \\A*\\ = £. 

Proof. We start by showing that A is self-adjoint. We have 

/ K K \ 

(A{x), p) = (^2v i {x,(j>i)(l)i,'^2(p,<l)i)(j>i ) 


\ 2=1 
K 


2=1 


2=1 

K 


K 


K 


= ( J2( X ’ </>*)& ) 

\ 2=1 2=1 / 

= {x,A(y)) 

where moving of summations are due to orthogonality of <pi. 
Next we show the first inequality in <IT4b : 


11-41(^)11 = 


> 


0 ^2 ++c ^2 + 

ieXcr ieXp 

K 

^2(x,^i)^i = 9\\x\\ 


since 0 < 6 < £. The second inequality in (Thil l is proven 
similarly. Finally, we show ||_4|| = (j. We have already shown 
that ||_4(x)|| < £||x|| for all x G R, i.e., that ||Al|| < C,. By 
definition of the operator norm, we also know that ||Al|| > 
||^4(x)|| for all x G R with Ikll < 1 - Choosing x = <f>j 
(which satisfies ||x|| = ||7'|| = 1) for any j G Ip (i.e. j with 
Vj = 0 g ives 

K 

= II7II = (• 


\\A\\ > \\A{f 0 )|| = 











































Thus, ||.4|| = £ and the proof is complete. □ 

This result implies that the assumptions in [10, Corollary 
2] (and Proposition [5} are met by /, g, and A in (fill . 
dm and ( 1 1 31 respectively. The bound on the convergence 
rate from [10, Corollary 2] (and restated in Proposition [5} 
is therefore valid. To show that this bound is tight for the 
class of problems under consideration, we need the following 
explicit characterization of d: 


we conclude that the rate bound in [10, Corollary 2] for 
ADMM on the primal problem, or equivalently for Douglas- 
Rachford splitting on the dual problem, is tight for the 
class of problems under consideration for many algorithm 
parameter choices. Especially, the bound is tight for the 
optimal paramters a and 7, as in the primal Douglas- 
Rachford case. 

VI. Conclusion 


d(v) ■.= r(-A*n) = r(-An) 
= swp {{-An, x) - f(x)} 

X 

= -inf{/(x) + {An,x)} 

X 

( K 


K 


i= 1 


i= 1 


= “ inf 1 E ) 2 + £><**>*,*> f 

v i=l 1=1 J 

( K K 

= “ \ E ¥<E 

K K '| 

(E a^i) f 

i =1 i=1 J 

= - y inf {^{difyi, (j>i ) 2 + {vi{n, </>*}</>*, a.;^}} 
2=1 
K 

= ~y inf {^ a i + 

2=1 
K 

= -E 


K 


2=1 


0 ViinAi }) 2 (t/i(/i,</>i}) 2 


2A,; 


A,: 


K 


= E 


(^( m , ^ i )) 2 




2=1 


2 A, 


= E 3aT< v » &>* 


i=l 


where the decomposition x = w ith at = (x, <j>i) 

is used, and the optimal a, = —Vi{p, </>*)/A*. The function c? 
has exactly the same structure as the function / but with Ai 
in / in (0 replaced by vf/Xi in d. The function g* is, for 
all n e H, given by 

g*{n) = sup {(/x, x) - t x=0 (a;)} = (/x, 0 ) = 0 . 

This implies that the dual problem (Q} with /, g, and A 
specified in (fill . (ITU ), and (fl3l > has exactly the same structure 
as the primal problem ( 0 ) with / and g specified in ( 0 i 
and © respectively and with A = Id . The only things 
that differ are the scalars that multiply the quadratic terms 
in the functions / and d respectively. Therefore, we can 
immediately state the following corollary to Theorem 0 

Corollary 1: Let / be given by ifTO . g be given by (fl2l >. 
and A be given by ( 1 1 3l i. Then the generalized Douglas- 
Rachford algorithm applied to solve the dual problem © 
(or equivalently ADMM applied to solve ©) converges as in 
Theorem0with 3 and o in Theorem[2]replaced by /? = ^ 
and a = respectively. 

The exact rate provided in Corollary Q] coincides with rate 
bound in [10, Corollary 2] and Proposition 0 Therefore, 


Recent results in the literature have shown linear con¬ 
vergence of Douglas-Rachford splitting and ADMM under 
various assumptions. In this paper, we have shown that the 
linear convergence rate bounds presented in [ 10 ] are indeed 
tight for the class of problems under consideration. 
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